Improved phase reconstruction in single-channel speech separation

نویسندگان

  • Florian Mayer
  • Pejman Mowlaee Begzade Mahale
چکیده

Conventional single-channel source separation (SCSS) algorithms are mostly focused on estimating the spectral amplitude of the underlying sources extracted from a mixture. The importance of phase information in source separation and its positive impact on improving the achievable performance is not adequately studied yet. In this work, we propose a phase estimation method to enhance the spectral phase of the underlying signals in SCSS framework. The proposed method relies on multi-pitch estimation and phase decomposition followed by applying temporal smoothing filters on the unwrapped mixture phase. We consider the combination of the proposed phase estimator with ideal binary mask and non-negative matrix factorization, as two well-known SCSS methods for separating the spectral amplitudes. Our results show that certain improvements in quality and intelligibility is achievable via replacing the mixture phase with the estimated one when reconstructing the sources.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phase estimation for signal reconstruction in single-channel speech separation

Single-channel speech separation algorithms frequently ignore the issue of accurate phase estimation while reconstructing the enhanced signal. Instead, they directly employ the mixed-signal phase for signal reconstruction which leads to undesired traces of the interfering source in the target signal. In this paper, assuming a given knowledge of signal spectrum amplitude, we present a solution t...

متن کامل

Phase estimation for signal reconstruction in single-channel source separation

Single-channel speech separation algorithms frequently ignore the issue of accurate phase estimation while reconstructing the enhanced signal. Instead, they directly employ the mixed-signal phase for signal reconstruction which leads to undesired traces of the interfering source in the target signal. In this paper, assuming a given knowledge of signal spectrum amplitude, we present a solution t...

متن کامل

Impact of phase estimation on single-channel speech separation based on time-frequency masking.

Time-frequency masking is a common solution for the single-channel source separation (SCSS) problem where the goal is to find a time-frequency mask that separates the underlying sources from an observed mixture. An estimated mask is then applied to the mixed signal to extract the desired signal. During signal reconstruction, the time-frequency-masked spectral amplitude is combined with the mixt...

متن کامل

Iterative sinusoidal-based partial phase reconstruction in single-channel source separation

Partial phase reconstruction based on a confidence domain has recently been shown to provide improved signal reconstruction performance in a single-channel source separation scenario. In this paper, we replace the previous binarized fixed-threshold confidence domain with a new signal-dependent one estimated by employing a sinusoidal model to be applied on the estimated magnitude spectrum of the...

متن کامل

Feature Space Reconstruction for Single-Channel Speech Separation

In this work we address the problem of separating multiple speakers from a single microphone recording. We formulate a linear regression model for estimating each speaker based on features derived from the mixture. The employed feature representation is a sparse, non-negative encoding of the speech mixture in terms of pre-learned speaker-dependent dictionaries. Previous work has shown that this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015